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Metrics in the Space of High Order Networks 

Weiyu Huang and Alejandro Ribeiro 


Abstract —This paper presents methods to compare high order 
networks, defined as weighted complete hypergraphs collecting 
relationship functions between elements of tuples. They can be 
considered as generalizations of conventional networks where 
only relationship functions between pairs are defined. Important 
properties between relationships of tuples of different lengths are 
established, particularly when relationships encode dissimilarities 
or proximities between nodes. Two families of distances are then 
introduced in the space of high order networks. The distances 
measure differences between networks. We prove that they are 
valid metrics in the spaces of high order dissimilarity and 
proximity networks modulo permutation isomorphisms. Practical 
implications are explored by comparing the coauthorship net¬ 
works of two popular signal processing researchers. The metrics 
succeed in identifying their respective collaboration patterns. 

I. Introduction 

We consider high order networks that describe relationships 
between elements of tuples and address the problem of con¬ 
structing valid metric distances between them. Most often, 
networks are defined as structures that describe interactions 
between pairs of nodes 121, Q. This is an indisputable appro¬ 
priate model for networks that describe binary relationships, 
such as communication or influence, but not so appropriate 
for problems in which binary, ternary, or n-ary relationships 
in general, have different implications. This is, e.g., true of 
coauthorship networks where we count the number of joint 
publications by groups of scholars. Papers written by parrs 
of authors capture information that can be used to identify 
important authors and study mores of research communities. 
However, there is extra information to be gleaned from col¬ 
laborations between triplets of authors, or even single author 
publications. The importance of capturing tuple proximities 
between groups of nodes other than pairs has been recognized 
and exploited in multiple domains including coverage analysis 
in sensor networks 11-0, cognitive learning and memory 
0, broadcasting in wireless networks H, image ranking l9ll , 
three-dimensional object retrieval and recognition ifTOl , and 
group relationship structure in social networks d. 

The problem of defining distances between networks, or, 
more loosely, the problem of determining if two networks 
are similar or not, is important even in the case of pair¬ 
wise networks. The problem is not complicated if nodes 
have equal labels in both networks ifT^ - lfTSll . The problem, 
however, becomes very challenging if a common labeling 
doesn’t exist in both networks, as we need to consider all 
possible mappings between nodes of each network. This 
complexity has motivated the use of network features as 
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alternatives to the use of distances. Examples of features 
that have proved useful in particular settings are clustering 
coefficients M, neighborhood topology ini, betweenness 
m, motifs El, wavelets ll20ll . as well as graphlet degree 
distributions or signatures ED-ES). Although feature analysis 
is often effective, it is application-dependent, utilizes only a 
small portion of the information conveyed by the networks, 
and networks not isomorphic may still have zero dissimilarity 
as measured by features. These drawbacks can be overcome 
with the definition of valid metric distances that are universal, 
depend on all edge weights, and are null if and only if the 
networks are isomorphic E4l . We point out that one can think 
of defining distances between networks as a generalization of 
the graph isomorphism problem E5l where the question asked 
is whether two networks are the same or not. When defining 
network distances we also want a measure of how far the 
networks are and we want these measures to be symmetric 
and satisfy the triangle inequality El- 

The main problem addressed in this paper is the construction 
of metric distances between high order networks. Formal defi¬ 
nitions of high order networks are presented (Section HIHi as a 
generalization of pairwise networks (Section in|. Dissimilarity 
networks (Section IIVI) and proximity networks (Section [V]) 
are specific high order networks where relationship functions 
are intended to encode dissimilarities or proximities between 
members of tuples. Dissimilarity networks are characterized by 
the order increasing property which states that tuples become 
more dissimilar when members are added to a group. Prox¬ 
imity networks abide to the order decreasing property which 
states that tuples becomes less similar when adding nodes to 
the group. Two families of proper metric distances are then 
defined in the respective space of dissimilarity (Section flV-AI) 
and proximity (Section IV-AI) networks modulo permutation 
isomorphisms. These distances are built as generalizations of 
the pairwise distances in EH, which are themselves gener¬ 
alizations of the Gromov-Hausdorff distance between metric 
spaces EH, E3- The paper also establishes a duality between 
dissimilarity and proximity networks and the different metrics 
(Section rV-BI) . We use the proximity network distances defined 
in the paper to compare the coauthorship networks of two 
popular signal processing researchers and show that they 
succeed in discriminating their collaboration patterns (Section 
Ell. As in the case of pairwise networks these distances can be 
computed only when the number of nodes is small. Ongoing 
work is focused on the problem of finding bounds on these 
network distances that are computable in networks with large 
numbers of nodes. 

II. Pairwise Networks 

Conventionally, a network is defined as a pair Nx = 
(Vr],), where A is a finite set of nodes and = 
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X X X —7> M+ is a function that may encode similarity or 
dissimilarity between elements. For points x, x' G X, values 
of this function are denoted as r\^{x,x'). We assume that 
r]^{x,x') = 0 if and only if x = a:' and we further restrict 
attention to symmetric networks where rj^{x,x') = r\r{x\x) 
for all pairs of nodes x, x' G X. The set of all such networks 
is denoted as M. 

When defining a distance between networks we need to 
take into consideration that permutations of nodes amount to 
relabelling nodes and should be considered as same entities. 
We therefore say that two networks Nx = {X, and Ny = 
(y, Ty) are isomorphic whenever there exists a bijection (p : 
X —5> y such that for all points x, x' G X, 

rj^(x,x') = r^((p(x),p(x')). (1) 

Such a map is called an isometry. Since the map p is bijective, 
O can only be satisfied when X is a permutation of Y. 
When networks are isomorphic we write Nx = Ny. The 
space of networks where isomorphic networks Nx — Ny are 
represented by the same element is termed the set of networks 
modulo isomorphism and denoted by JV mod =. The space 
JV mod = can be endowed with a valid metric ll24l . The 
definition of this distance requires introducing the prerequisite 
notion of correspondence ll^ Def. 7.3.17]. 

Definition 1 A correspondence between two sets X and Y is 
a subset C Q X x Y such that y x G X, there exists y GY 
such that (x, y) G C and \/ y G Y there exists x G X such 
that (x, y) G C. The set of all correspondences between X 
and Y is denoted as C{X, Y). 

A correspondence in the sense of Definition [T] is a map be¬ 
tween node sets X and Y so that every element of each set has 
at least one correspondent in the other set. Correspondences 
include permutations as particular cases but also allow for the 
mapping of a single point in X to multiple correspondents 
in y or, vice versa. Most importantly, this allows definition 
of correspondences between networks with different numbers 
of elements. We can now define the distance between two 
networks by selecting the correspondence that makes them 
most similar as we formally define next. 

Definition 2 Given two networks Nx = {X^r\f) and Ny = 
(y, Ty) and a correspondence C between the node spaces X 
and Y define the network difference with respect to C as 

^x,y{C):= max (xi,xa) - r^(j/i, t/ 2 )| • (2) 

{xi,yx),{x2,y2)&C 

The network distance between networks Nx and Ny is then 
defined as 

d\f{Nx,Ny) := {rx,y(C')|. (3) 

For a given correspondence C G C{X,Y) the network 
difference F^^ y(C') selects the maximum distance difference 
|r^ (xi, xa) — Xy (i/i, 2 / 2 ) I among all pairs of correspondents - 
we compare r]^{xi,X 2 ) with Xy(j/i,?/ 2 ) when the points xi 
and yi, as well as the points xa and j/a, are correspondents. 
The distance in 0 is defined by selecting the correspondence 


that minimizes these maximal differences. The distance in 
Definition |2] is a proper metric in the space of networks 
modulo isomorphism. It is nonnegative, symmetric, satisfies 
the triangle inequality, and is null if and only if the networks 
are isomorphic 1241. For future reference, the notions of metric 
and pseudometric are formally stated next. 

Definition 3 Given a space S and an isomorphism =, a 
function d : S X S ^ ^ is a metric in S mod = if for 
any a,b,c G S the function d satisfies: 

(i) Nonnegativity. d{a,b) > 0. 

(ii) Symmetry. d{a,b) = d{b,a). 

(Hi) Identity. d(a, b) = 0 if and only if a = b. 

(iv) Triangle inequality. (i(a, b) < d{a, c) + d{c, b). 

The function is a pseudometric in S mod = if for any 
a,b,cG S the function d satisfies (i), (ii), (iv), and 

(in’) Relaxed identity. d{a, b) =0 if a = b. 

A metric d in <S mod = gives a proper notion of distance. 
Since zero distances imply elements being isomorphic, the 
distance between elements reflects how far they are from 
being isomorphic. Pseudometrics are relaxed since elements 
not isomorphic may still have zero distance measured by 
the pseudometrics. The distance in Definition |2] is a metric 
in space JV mod =. Observe that since correspondences 
may be between networks with different number of elements. 
Definition 12] defines a distance d\f{Nx, Ny) when the node 
cardinalities |X| and |y| are different. In the particular case 
when the functions satisfy the triangle inequality, the set 
of networks JV reduces to the set of metric spaces Ai. In 
this case the metric in Definition |2] reduces to the Gromov- 
Hausdorff (GFI) distance between metric spaces. The distances 
d\f{Nx, Ny) in 0 are valid metrics even if the triangle 
inequalities are violated by or Xy ll24l . 

In this paper we consider high order networks where the 
specification of functions x^ : —>• Ry are meant to 

encode similarities or dissimilarities between node {k + 1)- 
tuples. The goal of this paper is to devise generalizations 
of Definition |2] to high order networks and to prove that 
they define valid metrics in the space of high order networks 
modulo isomorphism; see Definitions [H] [T210 and [15] 

III. High Order Networks 

A network of order K over the node space X is defined as 
a collection of K + 1 relationship functions {x^ : X^'^^ —> 
from the space of {k + l)-tuples to the 

nonnegative reals, 

= {^NxNx, ■ ■ ■ Nx) ■ ( 4 ) 

A network of order K can be considered as a weighted com¬ 
plete hypergraph li29l , li^ whose weights for all hyperedges 
of elements of all (/c -f 1) tuples with 0 < fc < AT are defined. 

When some nodes are repeated in the point collection 
xo-.k ■= (xo, xij ■ ■ ■ Nk) G X^^^, the relationship function 
Xx(xo:k) entails the same information as the relationship 
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function between the largest non-repeating subtuple of XQ-,k. In 
future definitions, it would be important to take the number of 
distinct elements of a tuple into consideration. We formalize 
this property by introducing the notion of the rank of tuples 
as we formally specify next. 

Definition 4 The rank s(xo:k) of a given tuple xo-.k is the 
number of unique elements in the tuple. 

It follows from Definition |4] that the rank s{x,x) = 1 
and that the rank s{x',x,x') = 2. Moreover, the relationship 
function between a tuple XQ-,k is identical to the relationship 
functions of subtuples of xo-,k that have same rank as s{xo:k) 
since they imply same information. This remark along with 
a symmetry property makes up the formal definition of high 
order networks that we introduce next. 

Definition 5 = (X, rx,rx, ■ ■ ■, rx) is a K-order net¬ 

work if the following two properties holds: 

Symmetry. For any 0 < k < K and any point collections 
xo:k, have that 

rxix[o:k]) = rxi.xo-k), (5) 

where X[o:fc] = ([xq], [a^i], • ■ •, [xk]) is a reordering of xo-.k '■= 
. . .,Xk). 

Identity. For any 0 < k < K and tuple xg-.k, ony of its 
subtuple xig-,ij^ with s{xo-,k) = s{xig-,i^) satisfies 

rxixo-.k) =r'f{xi^:i-J. (6) 

The set of all high order networks of order K is denoted as 

For point collections X[)-k, values of their fc-order relation¬ 
ship functions are denoted as (xqi/c) and are intended to rep¬ 
resent a measure of similarity or dissimilarity for members of 
the group. In particular, the zeroth order function r\ encodes 
relative weights of different nodes and the first order function 
represents the pairwise information discussed in Section 
m Observe however that pairwise networks are not particular 
cases of networks of order 1 because a network of order K not 
only requires the definition of relationships between {K -f 1)- 
tuples but also of relationships between (fc -I- l)-tuples for all 
integers 0 < k < K. A network of order 0 is one in which only 
node weights are given, a network of order 1 is one in which 
weights and pairwise relationships are defined, a network 
of order 2 adds relationships between triplets and so on. 
Examples for the identity property includes r\{x, x) = r\r{x) 
and r\{x',x,x') = r\{x,x'). We assume that relationship 
values are normalized so that 0 < r\{xo:k) < 1 for all k 
and xo:fe. As in the case of pairwise networks we consider 
AT-order networks Nx and to be equivalent for their k- 
order relationship functions if is a permutation of Vy as 
we formally dehne next. 

Definition 6 We say that two networks and are k- 
isomorphic if there exists a bijection : X ^ Y such that for 
all xo:k e we have 


where we use the shorthand notation Xy((/)(xo:fe)) := 

Ty ((^(xq), ..., (/)(xfe)). The map is called a k- 

isometry. 

When networks and Ny are fc-isomorphic we write 
Nx =k Ny . The space of AT-order networks modulo k- 
isomorphism is denoted by ff^ mod =k. For each nonnega¬ 
tive integer 0 < k < K, the space Af^ mod =k of networks 
of order K modulo fc-isomorphism can be endowed with a 
pseudometric. The definition of this family of pseudometrics 
is a generalization of Definition |2] as we formally state next. 


Definition 7 Given networks and Ny , a correspondence 
C between the node spaces X and Y, and an integer 0 < k < 
K define the k-order network difference with respect to C as 


rx.'v(C') max 

G0:k,y0:k)£C 

\r\ (xo:fc) 

-ry{yo,k) \ , 

(8) 

where the notation 

(^0:fc5 yO:k) 

stands 

for 


(s^Oj t/o)) (xi^yi ),..., {xk,yk)- The k-order network distance 
between networks N^ and Ny is then defined as 

NP) := ^^mm^^ {r^,y(C)} . (9) 

We further define the K-order distance vector as 
the AT -f 1 dimensional vector dj^(iVf,7Vf) = 

IfPx-iNx , Ny),..., d^{Nx , Ny)] that groups the 
k-order distances in ®. 

Both, Definition |2] and Definition |7] consider correspon¬ 
dences C that map the node space X onto the node space 
Y, compare dissimilarities, and set the network distance to 
the comparison that yields the smallest value in terms of 
maximum differences. The distinction between them is that 
in (|2|l we compare the values in r^(xi,X 2 ) and ry(?/i, 2 / 2 ), 
whereas in (|8]l we compare the values in each of the k- 
order relationships r\{x[)-k) and ry{y[).k) to compute the k- 
order distances d^{Nx, Ny) that we group in the vector 
d^{Nx , Ny). Except for this distinction. Definition |2] and 
Definition |7] are analogous since r^y(C') selects the max¬ 
imum fc-order relationship difference |?’^(xo:fc) — Xy(j/o:fe)| 
among all tuples of correspondents - we compare r^(xo:fe) 
with ry(j/o:fc) when all the points xi G xo-.k and yi G yo-k 
are correspondents. The distance d^{Nx , XP) is defined by 
selecting the correspondence that minimizes these maximal 
differences. 

Notice that, in general, the correspondence C minimizing 
r^y(C') is not necessarily identical to the correspondence 
C minimizing T^xYpd') foJ" k f 1. The distance vector 
dPf is a vector with each element measuring the dissimilarity 
between relationship functions of a specific order, possibly 
using different minimizing correspondences. We emphasize 
that, as in the case of Definition m d%{Np,NP) and 
are defined even if the numbers of nodes in X 
and Y are different. We show in the following proposition that 
the function d^ : xAf^ -G R+ is, indeed, a pseudometric 

in the space of AT-order networks modulo fc-isomorphism for 
any integer 0 < fc < a:. 


ryifiixo-.k)) = rx{xo:k), 


(7) 


define the network difference with respect to C as 
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Fig. 1. An example of two networks being not 1-isomorphic but having 
zero 1-order network distance between them. For the given correspondence 
C, rj^ixi,X2) = r^(yi,y2), rj^ixi.xs) = r^(yi,y2)- rj^(x2,X3) = 
rY{y 2 ,y 2 ) = ^^(^ 2 ) where the second equality follows from the identity 
property. Moreover, rj^(xi,xi) = Vyivi^yi), rj^ix 2 ,X 2 ) = r^(y 2 ,y 2 ), 
~ rY{y 2 ,y 2 )- = 0 witnesses the zero 1-order 

network distance between Nj^ and However these networks cannot be 
1-isomoiphic since they possess different number of nodes. 


Proposition 1 Given any nonnegative integer K, for any 
integers {) < k < K, the function d^ : J\f^ x IR.+ 

defined in (O is a pseudometric in the space mod =fc. 

Proof: See Appendix lAl ■ 

being a pseudometric implies that two high order net¬ 
works not fc-isomorphic may still have zero fc-order network 
distance between them. A specific example can be found in 
Figure [T] where two 1 -order networks not 1 -isomorphic have 
zero dissimilarity measured by the 1-order network distance. 
For each integer 0 < k < K, the pseudometric 
dehned in Dehnition |7] in the space mod =k measures 
dissimilarity between fc-order functions and Vy- We can 
also ask the question of how different two networks are by 
considering all their order functions. To that end we consider 
AT-order networks to be equivalent if is a permutation of 
for all integers 0 < fc < iT as we formally state next. 

Definition 8 "We say that two networks of order K, and 
Ny, are isomorphic if there exists a bijection (j) : X ^ Y 
such that (|7]l holds for all 0 < k < K and x^.k G X^^^. The 
map (j) is called an isometry. 

When networks and Ny are isomorphic we write 
= Ny. The difference between /c-isomorphism and 
isomorphism is that the bijection in the latter case preserves 
relationship functions over all orders whereas only fc-order 
relationship functions are preserved in the former case. That 
= Ny implies that =k Ny for all integers 

0 < k < K, but the opposite is not necessarily true. 

The space of AT-order networks modulo isomorphism is de¬ 
noted as mod =. A family of pseudometrics measuring 
the difference between networks over all order functions as 
a whole can be endowed in the space mod =. The 

dehnition of this family of distances can be considered as an 
extension of Dehnition |2] and an aggregation of Dehnition [T] 
as we formally state next. 


Dehnition 9 Given networks and Ny, a correspondence 
C between the node spaces X and Y, and some p-norm || • ||p. 




ffx,YiC),ryyic),...,rly 


P 

( 10 ) 


where for each integer 0 < k < K, F^ y iC) is the k-order 
network difference with respect to C defined in ®. The p- 
norm network distance between and Ny is then defined 
as 

dj^,piN^,NP) := ^^min^^ { ||rf .r(^^)|| J • dD 

The difference between Dehnition |2] Dehnition [T] and 
Dehnition |9] is that in the case of the network distance 
dMAN^.XP), we compare not only relationship functions 
I'xi^o-.k) and ry{yQ.k) but also all the relationship functions 
of order not larger than K. The norm over the vector y (C) 
formed by fc-order network differences with respect to C 
for all integers 0 < k < K is assigned as the difference 
between and Ny measured by the correspondence C. 
The distance d^f^p{N^,N y) is then dehned as the minimum 
of these differences achieved by some correspondence. As in 
the cases of Dehnition |2] and Dehnition |7] d^ANx^N^) is 
dehned even if the numbers of nodes in X and Y are different. 
The function d^-^p : x K+ is a pseudometric in 

the space of A'-order networks modulo isomorphism as we 
show in the following proposition. 


Proposition 2 Given some p-norm || • \\p, for any nonnegative 
integer K the function dj\f^p : J\f^ x —>■ K+ defined in 
(ED is a pseudometric in the space mod =. 

Proof: See Appendix iBl ■ 

Observe that in (ED we are only allowed to pick one 
correspondence minimizing ||r^y(C')||p whereas in (|9D for 
each k we are able to pick one correspondence minimizing 
the order specihc T\y{C). This establishes a relationship 
between djv'.p and ||d_^||p that we show next. 

Proposition 3 Given some p-norm || • \\p, for any nonnegative 
integer K the function dx'^p defined in (ED is no smaller 
than IIlip where d^ is the vector of distances defined in 
Definition^ I.e., for any pair of K-order networks N^, Ny , 
we have that 

dA^.p(fVf,iVf)>||d^(iVf,iVf)||^. (12) 

Proof: Given AT-order networks N^^Ny, a correspondence 
C between the node spaces X and Y, and an integer 0 < k < 
K, it follows from © that 

rly(C)>d^(iVf,iVf). (13) 

This implies that the vector d^{N^, Ny ) is element-wise no 

greater than y (C) from where it follows that 

||rf,y(c)||^> ||d^(ivf,fvf)||^. (14) 

Since (ED applies for any correspondence C, the mini¬ 

mum of ||r^y(C)||^ achieved by some correspondence in 
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the set of correspondence C(X,Y) is still no smaller than 

{l|riV(C)|l,} > l|di5(JVf (15) 

The result in (fTSl i follows after noting that the minimum in 
the left hand side of (fTsT i is the distance dj^,p{N^, Ny) in 

(HB. ■ 

Definitions |7] and |9] are pseudometrics in the space of 
high order networks modulo appropriate isomorphisms. To 
obtain proper metrics, we restrict attention to subclasses of 
networks having specific structures. To do so, observe that the 
fc-order function of a given network does not impose 
constraints on the Torder function of the same network 
except the identity property. In practical situations, however, 
it is common to observe that adding nodes to a tuple results in 
either increasing or decreasing relationships between elements 
of the extended tuple. This motivates the consideration of dis¬ 
similarity networks and proximity networks that we undertake 
in the next two sections. 

IV. Dissimilarity Networks 

In dissimilarity networks the function r\{x{)-k) encodes a 
level of dissimilarity between elements of the Xo:k tuple. In 
this scenario it is reasonable to assume that adding elements to 
a tuple makes the group more dissimilar. This restriction along 
with a generalization of the requirement that r\^{x, x') = 0 if 
and only \f x = x' in pairwise network makes up the formal 
definition that we introducre next. 

Definition 10 We say that the K-order network = 

i^X, ..., is a dissimilarity network if for any order 

0 < k < K and tuples xw.k S its relationship 

function is the summation of a dissimilarity function and the 
multiplication of its rank with a small constant e, 

rxi^o-.k) = dx{xo:k) + es(xo:k) (16) 

The dissimilarity terms satisfy the order increasing property 
so that for any 1 < k < K and Xg-.k, 

dx{xo:k)>d!"£^{xo:k-l), (17) 

and the constant e > 0 is a strictly positive value that satisfies 

0 < e < 1 — max dx{xo K)- (18) 

K XQ,K&X« + ^ 

The set of all dissimilarity networks of order K is denoted as 
V^. 

To see that the order increasing property (fTTI i in Definition 
ITOl is reasonable consider a network describing the temporal 
dynamics of the formation of a research community - see 
Figure |2] The dissimilarity term in the fc-order relationship 
function in this network marks the normalized time instant 
at which members of a given {k -f 1)-tuple write their first 
joint paper. In particular, the zeroth order dissimilarities dPx 
are the normalized time instants when authors publish their 
first paper. In Figure |2] authors A, B, C, and D publish their 
first papers at times 0, 1/9, 5/9, and 3/9. The first order 



Fig. 2. Temporal dynamics for the formation of a research community. 
The /c-order relationship function in this 2-order dissimilarity network [cf. 
Definition ESI incorporates the dissimilarity function - the normalized time 
instant at which members of a given (k + 1)-tuple write their first joint paper 
- and the multiplication of e with the rank of the tuple. E.g., A writes her 
first paper at time 0, and coauthors with B, D, and C at times 2/9, 4/9, and 
5/9. She also writes jointly with B and D at time 4/9. 

dissimilarities c?^ between pairs denote the normalized times 
at which nodes become coauthors. Since authors can’t become 
coauthors until after they write their first paper it is certain that 
d\{x,x') > dx{x) and d\{x,x') > dx{x') for all x and x'. 
In Figure |2l A and B become coauthors at time 2/9, which 
occurs after they publish their respective first papers at times 
0 and 1/9. Authors A and D as well as B and D become 
coauthors at time 4/9, A and C become coauthors at time 
5/9. Authors C and D never write a paper together. 

Second order dissimilarities d\ for triplets denote the 
normalized time at which a paper is coauthored by the three 
members of the triplet. Since a paper can’t be coauthored by 
three people without being at the same time coauthored by 
each of the three possible pairs of authors we must have that 
dx{x,x',x") > dx{x,x'), dx{x,x',x'') > dx{x,x''), and 
dfx\x,x',x") > dx{x',x'') for all x, x', and x". In Figure |2] 
authors A, B, and D publish a joint paper at time 4/9, which is 
no smaller than the pairwise coauthorship times between each 
two of the individual authors. Authors A, B, and C publish a 
joint paper at time 8/9, which is a time that comes after the 
individual paired publications that occur at times 2/9, 5/9, 
and 7/9. Note that due to symmetry property a relationship as 
in (fTTI) holds if we remove an arbitrary node from the tuple 
XQ-.k, not necessarily the last. 

In pairwise dissimilarity networks we required dx{x, x') = 
0 if and only if x = x'. Relationships between two different 
nodes are strictly greater than relationships between two nodes 
that are actually identical. The multiplication of e and the 
rank of the tuples in (IThl) in Definition [TO] can be considered 
as a generalization. Consider tuples xo-,k and (xg-.k-i, Xg) 
where every node in xg-.k is unique, the identity property for 
high order networks forces rx(xg:k-i, Xg) = r^^(xg:k-i)- 
We must then have the relationship between fc -f 1 different 
elements rx(xg-k) being strictly greater than the relationship 
between k different elements r\{xg-k-i,xg) = {xg-,k-i)- 

This is because d\{xg.,k) > ^{^ 0 :fe-i) follows from (fTTI) 

and €s{xg-k) = {k l)e > ke = es{xg-,k-i) follows from the 
definition of ranks. Therefore, the multiplication of e and the 
rank of tuples in (fThl) in DefinitionfTOlforces that adding a new 
element to a tuple makes the set strictly more dissimilar than it 
was. Or equivalently, removing an element from a tuple makes 
the set strictly less dissimilar than it was. The requirement 
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for e as in (fTSl l ensures that the highest relationship in 
the network maxjg.^gjfic+i d^{xo-,K) + €s{xo-,k) is bounded 
above by 1. The rank correction term es{xo:k) is a technical 
modification to distinguish between full rank (proper) fc-tuples 
and rank deficient (degenerate) tuples. In practice it can be 
set to a sufficiently small value compared to dissimilarities or 
completely ignored. Since distances up to order 2 are defined 
and relationship functions can be decomposed, the network in 
Figure 12] is a dissimilarity network of order 2. 

A. Metrics in the space of dissimilarity networks 

When the input networks in Definition |7] are dissimilarity 
networks we refer to the fc-order distance as the fc-order 
dissimilarity network distance. We state this formally in the 
following definition for future reference. 

Definition 11 Given dissimilarity networks D^^Dy € 
we say that the k-order distance d^{D^,DY) = 

d^{D^, Dy) of Definition [7| is the k-order dissimilarity 
network distance between and Dy. 

Since C , the function x — >• IR+ is a 

pseudometric in the space of iT-order dissimilarity networks 
modulo /c-isomorphism. The restriction, however, makes 
not only a pseudometric but a well-defined metric in the space 
mod =k of dissimilarity networks of order K modulo 
fc-isomorphism. We show this in the following theorem. 

Theorem 1 The k-order dissimilarity network distance func¬ 
tion df^ : X > ]R_|_ of Definition [77] is a metric in the 

space D^ mod =k for all 1 < k < K. 

Proof: See Appendix O ■ 

Observe that in Theorem [T] we have that is a proper 
metric for all k other than 0. This caveat for d^ is because 
we may have two dissimilarity networks and Dy with 
different number of nodes but whose zeroth other relationships 
are equals for all pairs of nodes, i.e., r^(a:) = ry(t/) for 
all x € X and y € F. In this case we we would have 
, Dy ) = 0, however the two dissimilarity networks 
are not 0-isomorphic. 

Restricting Definition[9]to dissimilarity networks also yields 
a family of dissimilarity network distances as next. 

Definition 12 Given dissimilarity networks D^^Dy S D^ 
we say that the p-norm network distance dj^^p{D^, Dy) = 
d'D,p{D^^Dy) of Definition ^ is the p-norm dissimilarity 
network distance between and Dy. 

By restricting our attention to dissimilarity networks instead 
of general high order networks, dx),p also becomes a valid 
metric in the space D^ mod = of dissimilarity networks of 
order K > 1 modulo isomorphism as we state in the following 
theorem. 

Theorem 2 Given some p-norm || • ||p, for any nonnegative 
integer K > 1 the function dx>^p : D^ X D^ —^ in 

Definition [72] is a metric in the space D^ mod =. 



Fig. 3. Collaborations between authors in a research community. The fc-order 
relationship function in this 2-order network [cf. Definition [ni incorporates 
the proximity function - the number of publications between members of a 
given {k 1)-tuples normalized by the total number of papers - and the 
multiplication of —e with the rank of the tuple. 

Proof: See Appendix ICl ■ 

Further note that since Proposition [3] holds for any pair of 
networks, the same relationship holds true for the dissimilarity 
network distances in Definitions [TT] and [T2] Observe, however, 
that the norm ||d:^(D^,Oy )|| is not a valid metric because 
we can have instances in which two dissimilarity networks 
are fc-isomorphic for all integers 0 < k < K without being 
isomorphic. 

V. Proximity Networks 

In proximity networks the relationship functions r\{xw.k) 
denote similarity or proximity between elements of a tuple. 
Thus, large values of the proximity function r\(x[)-.k) rep¬ 
resent strong relationship whereas small values denote weak 
relationships - the exact opposite is true of dissimilarity 
networks. In this framework it is reasonable to assume that 
adding elements to a tuple forces the group to be less similar. 
This constraint makes up the formal definition we introduce 
as follows. 

Definition 13 We say that the K-order network = 

is a proximity network if for any order 
0 < k < K and tuples xw.k S its relationship function 

is the summation of a proximity term and the multiplication 
of its rank with —e, 

rx(xo:k) = dx(xo:k) - es(xo:k), (19) 

The proximity terms satisfy the order increasing property that 
for any 1 < k < K and Xw.k, 

Pxixo.k) <p''£^{xo-.k-i), (20) 

and the constant e > 0 is a strictly positive value that satisfies 

0 < e < ^ min pf (21) 

K S:ex^+^ 

The set of all proximity networks of order K is denoted as 

-pK 

To see that the order decreasing property (i20l l in Defi¬ 
nition [T3] is reasonable, consider a network illustrating the 
collaborations between authors in a research community - 
See Figure [3] The fc-order proximity function in this network 
labels the number of publications between members of a given 
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{k + l)-tuple. In specific, the zeroth order proximities are 
the numbers of papers published by authors normalized by 
the total number of papers. In Figure [3 authors A, B, C, D 
publish 11,9, 2, 5 papers respectively and there are 19 papers 
in total which implies p\{A) = 11/19, p^{B) = 9/19, 
p\{C) = 2/19, p\{D) = 5/19. The first order proximities 
p\ represent the number of papers co-published by nodes. 
Since collaboration for a pair of authors is also a paper for 
each of the individuals it is certain that p^{x,x') < Pxi^) 
and px{x,x') < p^ix') for all x and x'. In Figured A and 
B collaborate on 4 papers, which is less than the 11 and 9 
papers written by each of the individuals. Authors A and C 
as well as A and D coauthor 2 papers in total. Authors C and 
D never write a paper together. 

Second order proximities p'j^ for triplets indicate the nor¬ 
malized number of papers coauthored by the three members 
of the triplet. Since a paper with three authors is also a 
collaboration for the three pairs of authors we must have 
p\{x,x',x") < p\{x,x'), p\{x,x',x") < p\{x,x"), and 
Px{x,x',x") < Px{x',x'') for all x, x', and x". In Figured 
authors A, B, and D cowrite 2 papers, which is no more than 
the number of pairwise collaborations between each pair of the 
authors. Remark that symmetry property inherited from high 
order networks [cf. Definition |5l implies (l20l l if we remove an 
arbitrary node from the tuple xo:k, not necessarily the last. 

In dissimilarity networks we required the relationship within 
tuple xo:k of unique elements to be strictly greater than the 
relationship between the point collection (a:o:fe-i,xo) where 
some nodes are repeating. The multiplication of — e and 
ranks in (fT^ in Definition [13] can also be considered as a 
generalization. Following the identity property of high order 
networks, r^(xo:fc-i, xq) = We must then have 

the function between fc -I- 1 different elements r^{xQ-k) being 
strictly smaller than the function between k different elements 
rxixo:k-i, Xq) = This is because in the de¬ 

composition p^(a;o:fc) ^ P^^{xo:k-i) follows from (i20l i and 
—es(xo:k) = —(k -|- l)e < —ke = —es(xo:k-i) follows from 
the definition of ranks. Therefore, the multiplication of —e 
and rank of tuples in ([T9] l in Definition [T0| forces that adding 
a new element to a tuple makes the set strictly less similar 
than it was. Or equivalently, removing an element from a 
tuple makes the set strictly more similar than it was. The 
requirement for e as in (l2lT i ensures that the lowest relationship 
function in the network rninj^ dxixo-.K) — es(io:fe) 

is nonnegative. Again the rank correction term es{xQ-,k) is a 
technical modification and in practice it can be set to suffi¬ 
ciently small compared to proximities or completely ignored. 
Since relationships up to order 2 are defined and can be 
decomposed, the network in Figure [3] is a proximity network 
of order 2. 

A. Metrics in the space of proximity networks 

In the same way that restricting attention to dissimilarity 
networks transforms the pseudometrics in Definitions [7] and 
[9] into metrics, restricting attention to proximity networks 
also results in the definitions of proper metrics. We state 
the restrictions of Definitions |7| and |9] in the following two 
definitions. 


Definition 14 Given proximity networks Px,Py € we 
say that the k-order distance d'^lPx , Py) = dfp{Px ^ Py) of 
Definition^is the k-order proximity network distance between 
P^ and Py. 

Definition 15 Given proximity networks Px^Py € we 
say that the p-norm network distance dx',piPxiPy) = 
d'p^p{Px , Py) of Definition^is the p-norm proximity network 
distance between P^ and Py ■ 

Analogously to the definition of the dissimilarity network 
distance df, of Definition [TT] the function dip : P^ x P^ 

R+ is a proper metric in the space P^ mod =k of proximity 
networks of order K modulo fc-isomorphism for all integers 
1 < k < K. Likewise, restricting the function dx'^p of 
Definition [9] to proximity networks as Definition [15] results 
in dp p being a proper metric. We state these facts in the 
following theorems. 

Theorem 3 The k-order proximity network distance function 
dp : P^ X P^ —>■ R+ of Definition [73 is a metric in the 
space P^ mod =k for all k > 1. 

Theorem 4 Given some p-norm || • ||p, for any nonnegative 
integer K > 1 the function d-p^p : P^ x P^ R+ in 
Definition [73 is a metric in the space P^ mod =. 

Proof: See Appendix iP] ■ 

In Theorem [3 we require fc > 1 for the same reason as in 
Theorem [T] We emphasize that dp is a metric in the space of 
proximity network modulo fc-isomorphisms, whereas d-pp is 
a metric in the space of networks modulo isomorphism. Also 
note that we must have d-p,p{Px , Py) > ||dp (P^, 
as per Proposition [3] but ||dp(P^,P/?)|| is not necessarily 
a metric. 

Remark 1 GH distance is the minimum across correspon¬ 
dences of the maximum difference in distances between pairs 
of nodes for a given correspondence. The metric definitions 
as in Definitions [n] [I2] [H and [15] inherit this property, 
which means that network distances can be dominated by a 
small portion of the networks. Put differently, the proposed 
distances are more sensitive to a few large differences in a few 
edges than to a large number of small differences in a large 
number of edges. Analogous consideration can be found in 
signal processing theory of the tradeoffs between comparing 
signals with averages - such as 2-norm comparisons - and 
comparing signals with max-min differences - the oo-norm 
comparison. When compare networks with different number of 
nodes, a max-min comparison is reasonable because it focuses 
attention in the bottleneck tuple that makes it impossible to 
match smaller network onto the larger. 

Remark 2 Once endowed with the proposed valid metrics as 
in Definitions [TT] [12] [14] and [15] the space of dissimilarity 
networks and the space of proximity networks become metric 
spaces. This implies that a number of algorithms that are used 
to analyze metric spaces can now be used to analyze high 
order networks. 



Fig. 4. Relationships between authors expressed in terms of dissimilarities 
constructed from the proximity network in Figure The fc-order relationship 
function in this 2-order network denotes the level of dissimilarities between 
members of a given (fc -|- l)-tuples. This is a dissimilarity network that has 
same order and identical node sets as the proximity network. 

B. Duality between dissimilarity and proximity networks 

Proximity and dissimilarity networks have been defined 
separately for simplicity of presentation, but they are actually 
related entities. For any proximity network with rela¬ 
tionship functions p\{xQ-,k), we can construct a dissimilarity 
network on the same node space by defining relationships 
as d\{xo:k) = 1 — Px{xo:k) for all orders k and tuples xo-k- 
Likewise given a dissimilarity network with relationship 
functions dx{xo:k) we can construct a proximity network 
Px by defining relationships Pxixo-.k) = 1 — d-xi^o-.k)- We 
formalize this equivalence through the introduction of dual 
networks in the following definition. 

Definition 16 Given a node space X, the K-order proximity 
and dissimilarity networks P^ = (X ,and 

Dx = (^x, dx ,..., dx^ are said duals if and only if 

Px{xO:k) = 1 ~ dx{X0:k), ( 22 ) 

for all orders 0 < k < K and tuples Xo-,k. 

It is ready to see that all proximity networks have a dual 
dissimilarity network and that, conversely, all dissimilarity 
networks have a dual proximity network. To do so we just 
reinterpret (l22l i as a definition and observe that: (i) The decom¬ 
position of relationships in the proximity network implies the 
valid decomposition of relationships in the dual dissimilarity 
network, and vice versa, (ii) The order decreasing property of 
the proximities in the proximity network implies the order in¬ 
creasing property of the dissimilarities in the dual dissimilarity 
network, and vice versa. An illustration for the construction 
of a dual dissimilarity network is presented in Figure |4] where 
we construct the corresponding dual dissimilarity network for 
the coauthorship network considered in Figure [3 

Given dual networks we can compute the distances in 
definitions [14] and [15] for proximity networks and the distances 
in definitions [TT] and [12] for the dual dissimilarity networks. 
These definitions have been constructed so that the resulting 
distances are the same, as we formally state in the following 
proposition. 

Proposition 4 Consider two proximity networks P^ and Py 
and their corresponding dual dissimilarity networks and 


Dy ■ The k-order proximity distances dff{P^,Py) [cf Def- 
inition [7?1? and k-order dissimilarity distances d^{D^, Dy) 
[cf. Definition 1771/ coincide for all 0 < k < K, 

4(Pf,P^) = 4(77f,77|^). (23) 

Likewise, the p-norm proximity distance d-p^pi^Py ,Py) 
[cf. Definition 1751/ and p-norm dissimilarity distance 
dvpiDx, Dy) [cf. Definition 1721/ coincide, 

dv,p{P^,P^) = dv,p{D§,D^). (24) 

Proof: See Appendix |E] ■ 

VI. Comparison of Coauthorship Networks 

We apply the metrics defined in Section IV-AI to compare 
second order coauthorship networks where relationship func¬ 
tions denote the number of publications of single authors, 
pairs of authors, and triplets. These coauthorship networks are 
proximity networks because they satisfy the order decreasing 
property in Definition [T3] Since both. Definition [l4l and Defi¬ 
nition [15] require searching over all possible correspondences 
between the node spaces, we can compute exact distances 
for networks with a small number of nodes only. Thus, we 
consider publications in the IEEE Transactions on Signal 
Processing (TSP) in the last decade but restrict attention to 
the collaboration networks of Prof. Georgios B. Giannakis 
(GG) of the University of Minnesota and Prof. Martin Vetterli 
(MV) of the Ecole Polytechnique Eederale de Lausanne. We 
choose these authors because their collaboration traits are 
more developed and stable and we expect their respective 
collaboration pattern to be steady over the past decade. The 
goal of the simulation is to illustrate that network metrics are 
able to distinguish discernible collaboration patterns. Eor each 
of the authors, GG and MV, we construct networks for the 
2004-2008 and 2009-2013 quinquennia. These networks are 
referred as GG0408, GG0913, MV0408, and MV0913. Eor 
GG we also define networks for each of the biennia 2004- 
2005, 2006-2007, 2008-2009, 2010-2011, and 2012-2013. 
We denote these networks as GG0405, GG0607, GG0809, 
GGlOll, and GG1213. Lists of publications are queried from 

ED. 

Eor each of these authors we consider all of their TSP 
publications in the period of interest and construct proximity 
networks where the node space X is formed by the author and 
the respective set of coauthors. Zeroth order proximities are 
defined as the total number of publications of each member of 
the network, first order proximities as the number of papers 
coauthored by pairs, and second order proximities as the 
number of papers coauthored by triplets. The constant e as 
in Definition [13] is for technical purpose. It can be chosen 
sufficiently small and for this reason we ignore it in this 
section. To make networks with different numbers of papers 
comparable we normalize all distances by the total number 
of papers in the network. With this construction we have that 
the zeroth order proximity of GG or MV are 1 in all of their 
respective networks. There are papers with more than three 
coauthors but we don’t record proximities of order higher than 
2 . 
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Fig. 5. Quinquennial coauthorship networks representing research communities centered at Prof. Georgios Giannakis (GG) or Prof. Martin Vetterli (MV). 
The size of the nodes is proportional to the zeroth order proximities, and the width of the links to the first order proximities. Second order proximities are 
represented by shading the triangle enclosed by the coauthor triplet. Color intensity is proportional to the second order proximities. 




Fig. 6. Biennial coauthorship networks representing research communities centered at Prof. Georgios Giannakis (GG). 
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Fig. 7. Two dimensional Euclidean embeddings of the fc-order proximity network distances d^,d}p,d^ and the proximity network distance with respect 
to the 1-norm, d-p^i, between the quinquennial networks. In the embeddings, denote MV0408, MV0913 as circles, GG0408, GG0913 as diamonds. GG0408 
and GG0913 are colocated regarding d}p,d^. 


The quenquennial networks GG0408, GG0913, MV0408, 
and MV0913 are shown in Figure |3 and the biennial networks 
GG0607, GG0809, GGlOll, and GG1213 in Figure |6] The 
size of the nodes is proportional to the zeroth order distances, 
and the width of the links to the first order distances. Second 
order proximities are represented by shading the triangle 
enclosed by the coauthor triplet and the color intensity is 
proportional to the second order proximities. There are clear 
differences in the collaboration patterns. We show here that 
proximity network distances succeed in identifying these pat¬ 
terns and distinguish between the coauthorship networks of 
GG and MV. 

A. Quinquennial networks 

Two dimensional Euclidean embeddings (respect to mini¬ 
mizing the sum of squares of the interpoint distances) of the 


fc-order proximity network distances dp for k S {0,1,2} and 
the proximity network distance with respect to the 1-norm, 
dp i are shown in Figure [T] The two GG networks (dia¬ 
monds) separate clearly from the two MV networks (circles) 
either by considering the individual fc-order distances dp or 
the aggregate distance dpj. The distances between the two 
MV networks are high but still smaller than the distances 
between GG networks and MV networks. An unsupervised 
classification run across all four distances would assign all 
four networks correctly. 

The fc-order network distance dp is defined by searching 
for the correspondence such that the maximum fc-order prox¬ 
imity difference |r^(a;o:fe) — T'Y{y[)-.k)\ among all tuples of 
correspondents is minimized [cf. dUl and dH)]. For the optimal 
correspondence C* = argminj-^gg^jj-F^ y (C), define the 
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Fig. 8. Two dimensional Euclidean embeddings of the distances , d}p, , d-p between all quinquennial and biennial networks. In the embeddings, 

denote MV0408, MV0913 as circles, GG0408, GG0913 as diamonds, GG0405, GGObO?, GGlOll as up triangles and GG0809, GG1213 as down triangles. 
GG0809, GG1213 are colocated regarding d^,d^,dp i. GG0408 and GG0913 have identical coordinates in dip. 


pair of correspondent tuples that achieve the maximum A:-order 
difference as 

K:fc:2/o:fc)= argmax |r|(a:o:fc) - r^(yo:fe)| • (25) 

{xO:k,yO:k)&C* 

The tuple pair (a^Q.^., i/g.^,) is the bottleneck that prevents 
making the networks closer to each other. Examining these 
bottleneck pairs for each A:-order distance reveals what are the 
differences between proximity networks to which is most 
sensitive about. In general, A:-order bottleneck pairs tend to be 
pairs of tuples with high proximity values in their respective 
networks. The optimal correspondence C* map tuples with 
high proximity as closely as possible. Therefore, network 
distances are typically determined by large proximity values in 
one of the networks that can’t be matched closely to proximity 
values in the other network. 

In the quinquennial coauthorship networks of Figure |5] 
the bottleneck pair for 0-order distances d^, is formed by 
nodes with high zero order proximities and dp reflects the 
difference between their zero order proximities. Since the 
networks are normalized so that the lead nodes have size 1, dp 
is determined by their predominant coauthors, i.e., the scholars 
that collaborated most proliflcally with GG or VM during the 
period of interest. The distances dp between GG and VM 
networks are large because these predominant collaborations 
are different. In GG networks there are usually groups of 3 to 
5 predominant collaborators, whereas in MV networks there 
are usually one or two that concentrate a larger fraction of the 
total number of publications. 

Similarly, high first order proximity distances are likely 
due to one of the following situations: (i) Large differences 
between the numbers of papers authored by the predominant 
collaborators, (ii) Different patterns in the formation of com¬ 
munities - defined here as clusters of pairwise collaboration. 
In the latter case large distances arise because it is impossible 
to match the communities in one network to communities in 
the other. The distances dp between GG and MV networks 
are large because the latter contain a smaller number of 
communities, which are also more strongly connected than 
the communities in GG networks. 

In second order distances the bottleneck pair of triplets 
may reflect one of the following scenarios: (i) One network 
has collaboration between four or more authors while the 
other doesn’t, (ii) There exist three authors with a strong 
collaboration between them in one network whereas in the 


other network there does not exist collaboration between three 
authors or, if such collaboration exists, it is weak. Many papers 
written by MV are collaborations of three or four scholars 
and the predominant coauthor in MV networks appears in at 
least one collaboration of four scholars. For GG, his 2004- 
2008 network has a few collaborations consisting of four 
scholars however all such collaborations are weak. His 2009- 
2013 network has no publications written by four authors. 

B. Biennial networks 

The networks GG0408 and GG0913 have more nodes than 
the networks MV0408 and MV0913 prompting the possibility 
that the differences in distances discussed in Section IVI-BI are 
just due to their different number of publications. This is part 
of the reason, but not all. To see that this is true we consider the 
biennial GG collaboration networks. Each of these networks 
contain numbers of papers that are comparable to the number 
of papers in the quinquennial MV networks. 

Two dimensional Euclidean embeddings of the individual k- 
order distances dp for k € {0,1,2} and the aggregate distance 
dp 1 between the 4 quinquennial networks and the 5 biennial 
networks are shown in Figure[8l An unsupervised classification 
run across four distances would assign all nine networks 
correctly (dp, dp) or two of them incorrectly (dpjdp^). 

We expect more variation in biennial networks because 
the time for averaging behavior is reduced. E.g., we may 
see deviations from usual collaboration patterns due to the 
presence of exceptional doctoral students. Still, three of the 
biennial networks, GG0405, GG0607, GGlOll, (up triangles) 
and the two quinquennial networks GG0408, GG0913 (di¬ 
amonds) are close to each other in every metric used and 
form a cluster clearly separate from the two five-year networks 
MV0408 and MV0913 (circles). This is due to the fact that 
the distinctive features of GG coauthorship are well reflected 
in GG0405, GG0607, GGlOll. These features include: (i) 
Multiple predominant coauthors, each of whose collaboration 
with GG does not comprise a dominant portion of GG’s 
scholarship during the period, (ii) Multiple small coauthor¬ 
ship communities in which strong collaborations within each 
community are rare, (iii) The number of publications with four 
or more authors is low. These features contrast with the rather 
opposite properties of the MV networks. 

The networks GG0809 and GG1213 (down triangles) do not 
cluster nicely with the other five GG networks. Depending on 
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which distance we consider they may be closest to some of the 
other GG networks or to one of the two MV networks. This is 
because, likely due to random variation, GG0809 and GG1213 
have some features that resemble GG networks and some 
other features that resemble MV networks. Fundamentally this 
happens because of the exceptionally prolific collaborations 
with loannis Schizas (IS) in the 2008-2009 period and Gonzalo 
Mateos (GM) in the 2012-2013 period. In the network GG0809 
the IS node commands a significant fraction of GG publica¬ 
tions and creates strong links between collaboration clusters 
that would be otherwise separate. Both of these features are 
more characteristic of MV networks. In GG1213 network the 
GM node accounts for half of the publications in which GG is 
an author. This is, also, a feature more representative of MV 
networks than of GG networks. 

In summary, proximity network distances capture features 
of scholar collaboration that permit discerning networks of 
different authors even when we consider networks that have 
very different numbers of nodes. The zeroth order distance 
responds primarily to the number of predominant coauthors 
and the proportion of collaboration between predominant 
coauthors and the central scholar. The first order distance 
is mostly determined by the fraction of collaborations that 
involve predominant coauthors and the central scholar as well 
as the level and number of strong collaborations within each 
community in the group. The second order distance is 
largely given by the existence, level, and number of collab¬ 
orations between four or more scholars and the appearance 
of predominant coauthors in a collaboration between four or 
more scholars. 

Remark 3 The proposed metrics successfully identify the 
distinct collaborative behaviors of Prof. G. B. Giannakis and 
Prof. M. Vetterli from incomplete subsets of their publi¬ 
cation datasets. The distances between Giannkis’s networks 
(either quinquennial or biennial) are smaller than the distances 
between Giannakis’s networks and Vetterli’s networks. This 
proximity can be used in author name disambiguration or 
related problems, e.g., adjudicate the biennial networks to their 
rightful author if only the authors of the quinquennial networks 
are known. 

Remark 4 As a comparison, we applied some simple and 
reasonable methods to compare the corresponding pairwise 
networks of the coauthorship networks considered in this 
section. Motifs have been shown effective in distinguishing 
coauthorship networks from different scientific fields HD. 
To compare high order coauthorship networks by motifs, we 
restrict attention to pairwise relationships. The dissimilarities 
between coauthorship networks are assigned as the differences 
between the summations of the weighted motifs in their 
corresponding pairwise networks. Analysis based on triangle 
motifs (weighted) results in MV0408, MV0913, GG0408, and 
GG0809 being closer to each other and GG0913, GG0405, 
GG0607, GGlOll, and GG1213 being more proximate. 
Tetrahedron motif analysis (weighted) results in MV0408, 
MV0913, GG0408, GG0405, GG0607, and GG0809 being 
closer to each other and GG0913, GGlOll, and GG1213 


being more proximate. Other simple and common methods 
to compare pairwise networks yield similar results. Methods 
to compare pairwise networks via features give us similar 
observations as those based on the metric distances proposed 
in the paper. Notice that GG0408 and GG0913 are highly sim¬ 
ilar regarding the proposed network distances however their 
differences are relatively large in terms of feature comparisons. 

VII. Conclusion 

We have considered high order networks as a generalization 
of conventional pairwise networks and discussed the defini¬ 
tion of valid metrics to enable their comparison. High order 
networks satisfy the specification of degeneracy relations that 
relationship function within a tuple of repeating elements is 
identical to the relationship within its largest subtuple with 
unique elements. The table in Figure |9] summarizes the results 
derived in this paper. The fundamental definitions are those of 
the fc-order network differences introduced in Definition |7] and 
the p-norm difference introduced in Definition |9] Proposition 
[T] proves that the distances x —>■ R+ are pseu¬ 

dometrics in the space of networks modulo fc-isomorphism. 
Proposition |2] shows that dj>j-^p : x K.+ is a 

pseudometric in the space of networks modulo isomorphism. 

We also introduced the space of dissimilarity networks 
of order K in Definition [TO] and the space of proximity 
networks in Definition [T3] Dissimilarity networks also satisfy 
the order increasing property whereby tuples become more 
dissimilar when members are added to the group. Proximity 
networks abide to the order decreasing property whereby 
tuples becomes less similar when adding nodes to the group. 

When restricted to the space of dissimilarity networks the 
distance d^ : x —>■ R+ is termed the fc-order 

dissimilarity network distance [cf. Definition fTTIl and the 
distance ^ x R+ is termed the p-norm 

dissimilarity network distance [cf. Definition (TT) ■ We proved 
that the fc-order dissimilarity network distance is a metric in 
the space mod =k of dissimilarity networks modulo 
fc-isomorphism for any integers fc > 1 [cf. Theorem [T] and 
that the p-norm dissimilarity network distance is a metric 
in the space mod = of dissimilarity networks modulo 
isomorphism [cf. Theorem ID. Analogous results hold true 
for proximity networks as summarized in the last column of 
the table in Figure |9] and spelled out in Definitions [14] and 
[TSl and Theorems |3] and |4| We have also shown that the p- 
norm \\d^{N^, of the vector that groups the fc-order 

differences d\f{N^, ^y) lower bounds the p-norm difference 
dj^,p{N^, NP) [cf. Proposition 13 • This property is inherited 
when we restrict attention to proximity and dissimilarity 
networks as summarized in the bottom row of the table in 
Figure |9] 

Proximity and dissimilarity networks are equivalent con¬ 
structions as it follows formally from the notion of duality 
introduced in Definition [T6| We have shown that this duality 
extends to the various distances defined in the sense that prox¬ 
imity distances between two proximity networks is the same as 
the dissimilarity distances between their corresponding duals 

[cf. Proposition |4l • 
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Input space 

i^-order networks, A/*^ 

A'-order dissimilarity networks, T>^ 

if-order proximity networks, 

fe-order difference 

X —)■ M 4 - [cf. Definition 0 

Pseudometric in mod 

X ^ E-f- [cf. Definition 

Metric in T>^ mod for fc > 1 

d^ : X ^ E+ [cf. Definition 

Metric in mod for fc > 1 

p-norm difference 

dM,p : X —>■ M+ [cf. Definition!^ 

Pseudometric in mod = 

dT),p ■. X T)^ —y R-f [cf. Definition 1121 

Metric in mod = 

d-p,p : X —>■ E-f- [cf. Definition^] 

Metric in mod ^ 

Relationships 

dMAN^,N^) > ||d^(Arf .Arf)||p 

d-uAD^.D^) > ||dg(Df,Df)||p 

d'p,p{Px ’ Pp) ^ 11 *^^{Px : Py)\\p 


Fig. 9. Relationships between the spaces of high order networks, dissimilarity networks, and proximity networks. A family of pseudometrics can be defined 
to measure dissimilarities between a specific order functions between high order networks. Another family of pseudometrics can be defined to quantify 
distinctions between high order networks across all order functions. These two families of pseudometrics are related and become metiics in the con'esponding 
spaces when we restrict attentions to dissimilarity networks or proximity networks. 


We illustrated the value of our definitions by using prox¬ 
imity network distances to successfully identify collaboration 
patterns of Prof. Georgios B. Giannakis and Prof. Martin Vet- 
terli. With respect to future goals the most important limitation 
in the current manuscript is that distances are difficult to 
compute when the number of nodes in the network is large. 
For networks with large number of nodes it is necessary to 
develop tools for approximate evaluation of network distances. 
These tools exist for the comparison of metric spaces and their 
generalization to networks is part of ongoing research. The 
idea is to relate high order dissimilarity networks to simplicial 
complexes and filtrations so that distances between networks 
can be lower bounded or reasonably approximated by the dif¬ 
ference between persistence homologies of the corresponding 
filtrations lf3^ . 1331. 

Appendix A 

Proof of Proposition[T| 

To prove that for any integer 0 < A: < AT is a 
pseudometric in the space of iT-order networks modulo k- 
isomorphism we prove the (i) nonnegativity, (ii) symmetry, 
(iii’) relaxed identity, and (iv) triangle inequality properties in 
Definition [2 

Proof of nonnegativity property: For any integers 0 < 

k < K, since |r^(a;o:fe) - r^(yo:fc)| is nonnegative r|-_y(C') 
defined in ® also is. The network distance must then satisfy 
T ^y) — 0 because it is a minimum of nonnegative 
numbers. ■ 

Proof of symmetry property: A correspondence C C X xY 
with elements Ci = {xi,yi) results in the same associations as 
the correspondence C C Y x X with element Ci = {yi,Xi). 
Thus, for any correspondence C and integers 0 < k < K, we 
have a correspondence C such that r^y(C') = TyxiC)- It 
follows that the minima in (|9l must coincide from where it 
follows that , NP) = d%-{NP, N^). U 

Proof of relaxed identity property: We need to show 
that for any integers 0 < fc < AT if Nx and Ny are k- 
isomorphic we must have d^{Nx, Ny) = 0. To see that 
this is true recall that for fc-isomoiphic networks there exists 
a bijection cj) : X ^ Y that preserves distance functions at 
order k [cf. O]. Consider then the particular correspondence 
= {(x, 4>{x)), X € X}. For all xq G X there is an element 
c = (xo ,y) G C(j) and for all yo G Y there is an element 
c' = {x,yo) G since (p is bijective. Thus C^j, is a valid 


correspondence between X and Y for which (|7l indicates that 
it must be 

ryivo-.k) = ry{(j){xo:k)) = rx{xo,k), (26) 

for any {xo:k,yo:k) G C^. This implies r^y(C') = 

\rxixo-.k) - ry(yo:fc)| = 0 for any {xo:k,yo-.k) G C^. Since 
is a particular correspondence, taking a minimum over all 
correspondences as in (|9]l yields 

<r^,y(C')=0. (27) 

Since d^{Nx ,Ny) > 0, as already shown, it must be that 
d^{Nx , Ny) = 0 when Nx and Ny are fc-isomorphic. ■ 

Proof of triangle inequality: To show that the triangle 
inequality holds, let the correspondence Ci between X and 
Z and the correspondence C2 between Z and Y be the 
minimizing correspondences in (|9]l. We can then write 

d^(ivf,7V#)=r|_y(C2). 

(28) 

Define a correspondence C between X and Y as the one 
induced by pairs (x^z) and {z,y) sharing a common node 
z G Z, 

C := {ix,y) \BzG Z with {x,z) G Ci,{z,y) G C2} ■ (29) 

To show that C is a well defined correspondence we need to 
show that for every x G X there exists yo G Y such that 
ix,yo) G C and by symmetry for every y GY there exists 
xq G Y such that {xo,y) G C. To see this, first pick an 
arbitrary x G X. Because Ci is a correspondence between 
X and Z there must exist zq G Z such that (x, zq) G Ci. 
There must exist yo G Y such that (zo,2/o) G C 2 since 
C2 is also a correspondence between Y and Z. Therefore, 
there exists a pair {x,yo) G T with yo G Y for any 
X G X. The second part follows by symmetry and C is a well 
defined correspondence. The correspondence C may not be the 
minimizing correspondence for the distance d^{Nx, Ny). 
However since it is a valid correspondence with the definition 
in we can write 

d'^iN^,NP)<r’k,yiC). ( 30 ) 

By the definition of C in ( |29] |. the requirement {xo-,k,yo-.k) G C 
is equivalent as (xorfc, zo:fe) S Ci and (zo:fc, Vo-.k) G C2 for any 
0 < k < K. Further adding and subtracting r^{zo-k) in the 
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absolute value ofr^y,(C') = |r^(a:o:fc)~?'y(yo:fc)| and using 
the triangle inequality of the absolute value yields 

T\y{C)< max {\r\{xo,k) - r%{zo,k)\ 

{_XQ,k,ZQ,k)^C-l ^ 

(•20;fc ,2/0:fe)€C2 

+ \r%{zo,k) - ?'y(2/0:fc)|}- 

(31) 

We can further bound ( ISTT i by taking maximum over each 
summand, 

^xy{C) < max |r^(a;o:/c) - ?'|(- 2 :o:fc)| + 

(rC0:fe,20;fc)eCl 

max |?’z(^0:fe) —fr(yo:fe)|=rx,z(C'i)+r|,y(C'2)- 

{zo-.k,yo.k)&C 2 

(32) 

Substituting (l30l l and (l28T l into (l32l l yields triangle inequality. 


Appendix B 

Proof of Proposition!!] 

To prove that dx'^p is a distance in the space of AT-order 
networks modulo isomorphism we prove the (i) nonnegativity, 
(ii) symmetry, (iii’) relaxed identity, and (iv) triangle inequality 
properties in Definition [3 

Proof of nonnegativity property: Since ||r^^(C')||p > 0, 
the network distance must then satisfy dj\f^p{N^, Np) > 0 
as it is a minimum of nonnegative numbers. ■ 

Proof of symmetry property: A correspondence C C X xY 
with elements Ci = {xi,yi) results in the same associations as 
the correspondence C C Y x X with element Ci = {yi,Xi). 
Thus, for any correspondence C we have a correspondence C 
such that T^yi^) = This implies ^((7)11^ = 

l|Tyx(C')||p- It follows that the minima in (fTTTi must coincide 
and therefore dx-^p{N^, N^) = dx-^p{NP, N^). ■ 

Proof of relaxed identity property: We need to show that if 
and Ny are isomorphic we must have djy^p{N^^ ^y) — 
0. To see that this is true recall that for isomorphic networks 
there exists a bijection (p : X ^ Y that preserves distance 
functions at every order [cf. (|7]i]. Consider then the partic¬ 
ular correspondence = {{x^ (j){x)), x G X}. We have 
demonstrated in Appendix lAl that is a valid correspondence 
between X and Y. The definition of isomorphism indicates 
that it must be (l2&l i holds true for all 0 < fc < AT and 
{xo-.k,yo:k) G C(j). Since is a particular correspondence, 
from (fTTT i it follows that 

dAr.p(iVf,Arf)< ||rf,y(C)||^. (33) 

Because r^{xo-,k) — ry{yo-.k) = 0 for any 0 < k < K 
and any (xqi/c, yo:fc) G by (|26l), we have T^ y{C) = 0. 
II • lip being a proper norm implies l|Tx,r(T^)llp — O' 
stituting this back into (l33l l shows dj\f^p{N^,Ny) < 0. 
Since djy^p{N^, Np) > 0, as already shown, it must be that 
dj\f.p{Np,Np) = 0 when Np and are isomorphic. ■ 

Proof of triangle inequality: To show that the triangle 
inequality holds, let the correspondence Ci between X and 


Z and the correspondence C 2 between Z and Y be the 
minimizing correspondences in (fTTIi . We can then write 

d^,piNP,NP) = \\rlziCi)\\ 

^ (34) 

d^,piNp,NP) = \\TPyiC2)\\^. 

Define a correspondence C between X and Y in the same 
way as (|2^ . We have demonstrated in Appendix lAl that C is 
a well defined correspondence. Therefore with the definition 
in we can write 

d^,piNP,NP)<\\rlyiC)\\^. (35) 

Moreover, in Appendix lAlwe also showed for any 0 < k < K, 

rly(C)<r^,^(Ci) + r|^^(C'2). (36) 

This implies the vector Tp ^{Ci)+ Tpy{C 2 ) is elementwise 
no smaller than the vector y{C). The definition of p-norm 

ll^llp “ (Sfco guarantees that the value of ||x||p 

is monotonically nondecreasing on each element Xi in x = 
{xq, xi,..., Xnp. Therefore, 

\\TPYiC%<\\TPziCi) + T%^yiC2)\l. (37) 

We can further bound (ITtT i by using the triangle inequality of 
the p-norm, 

||rlv(c)||p < \\^x,ziCi)\\p + ||r|.y(C2)||^. (38) 

Substituting dTSl l and (l34l l back into (l38l l yields the triangle 
inequality. ■ 

Appendix C 

Proofs in SECxiON riV-AI 

Proof of Theorem [TJ The proof in Appendix has demon¬ 
strated dfpi is a pseudometric in the space mod =fc. To 
prove that dp is a metric in the same space we need to show 
the missing part in the (iii) identity property in Definition [3 
Proof of the second part of the identity property: We 
want to prove dp{DP,DP) = 0 must imply that and 

are fc-isomorphic. If d^{DP, Dp) = 0, there exists a 
correspondence C such that rp{xo-,k) = ry{yo.k) for any 
{xo-.k,yo:k) G C. Define a function (j> : X ^ Y that associates 
X with an arbitrary y chosen from the set that form a pair with 
X in C, 

(p : X 1 -^ yo G {y \ {x,y) G C}. (39) 

Since C is a correspondence the set {y \ (x,y) G C} is 
nonempty for any x implying that p is well-defined for any 
X G X. Therefore rp{xo.,k) = ((^(a;o:fe)) for any xo.,k- This 

implies the function p must be injective. If it were not, there 
would be a pair of nodes x p x' with p{x) = p{x') = y for 
some y gY. Hence the Ic-order relationship function between 
{x,... ,x, x') where the first k — 1 nodes in the tuple are x 
and the last node is x' would satisfy 

rp{x...,x,x') = rfp{p{x,...,x,x')) = rp{y,... ,y), (40) 
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follows from the definition of (p. The fc-order relationship 
between the tuple (x,... ,x) where all the k nodes are 
identical would also satisfy 

r\{x,...,x)= ryiPix,... ,x)) = Tyiv,.. ■ ,y). (41) 

Combining ( |40] | and (HTI) yields 

= r^(a;, (42) 

Meanwhile, the identity property for high order networks [cf. 
Definition 13 implies 

rxix,...,x,x') =rxix,x'), r^(x, ...,x) =r^(x). 

(43) 

Using the fact that for dissimilarity networks, relationship 
functions are the summations of dissimilarity functions and 
the multiplication of e and ranks, we have that 

rx{x,x') = dx{x,x') +2€, rx{x) = dx{x) + e. (44) 

Moreover, the order increasing property for dissimilarity func¬ 
tions implie 

dxix,x') >dx{x). (45) 

Substituting the decompositions (l44l l and (|45] | into (l43t yields 

rxix,...,x,x') > rxix,...,x). (46) 

which contradicts with (l42l l and shows that (p must be injective. 

Likewise, define the function ip : Y ^ X that associates y 
with an arbitrary x chosen from the set that form a pair with 

y in C, 

Ip :y^ xo€ {x|(x, y) € C}. (47) 

It follows by similar arguments that ip must be injective. 
By applying the Cantor-Bernstein-Schroeder theorem ll54l 
Section 2.6] to the reciprocal injections p : X ^ Y and 
Ip :Y X, the existence of a bijection between X and Y is 
guaranteed. This forces X and Y to have same cardinality 
and p and ip being bijections. Pick the bijection p and it 
follows r^(xo:fe) = fy ((^(xo:/c)) for all nodes (fc + l)-tuples 
xo:fc € This shows that Dx —k Dy completes the 

proof of the identity statement. ■ 

Having demonstrated all four properties in Theorem [1] the 
global proof completes. ■ 

Proof of Theorem |2l The proof in Appendix |B] has demon¬ 
strated that d-D^p is a pseudometric in the space mod =. 
To prove that du p is a metric in the same space we further 
demonstrate the missing part in the (iii) identity property in 
Definition |3 

Proof of the second part of the identity property: We want 
to show dT>,p{Dx, Dy) = 0 implying and Dy being iso¬ 
morphic. If dv,p{D^,D^) = mincec(x.r) l|rf^v(C')llp = 
0, there exists a correspondence C such that 

l|riy(C)||p = 0 . (48) 

The property of p-norm implies that this correspondence C 
satisfies r^y(C') = 0 for 0 < k < K, i.e. r^(xo:fc) = 
ryivo-.k) for any 0 < fc < AT and (xo:/c, 2 / 0 :fe) S C. Define 


functions p : X ^ Y as in (l39T l and ip : Y ^ X as in ( l47T i. 
The analysis in Proof of Theorem [T] has demonstrated that p 
and Ip are bijections and that X and Y have same cardinality. 
Pick the bijection p and it follows r\{xo:k) = rY{p{xo-,k)) 
for any 0 < k < K and all (fc + l)-tuples xo:fc € X. This 
shows that — Dy and completes the proof of the identity 
statement. ■ ■ 

Appendix D 

Proofs in SECTiON rv-AI 

Proof of Theorem |3 : The proof in Appendix has 
demonstrated that dfp is a pseudometric in the space 
mod =k- To prove that d!p is a metric in the same space 
we need to show the missing part in the (iii) identity property 
in Definition [3] 

Proof of the second part of the identity property: Most 
parts of the proof follow from the proof of the second part 
of the identity property for Theorem [T] in Appendix O The 
only difference is in demonstrating the function p constructed 
in ( |39] | is injective. Under the same setup where there exist a 
pair of nodes x 7 ^ x' such that p{x) = p{x') = y for some 
y GY, the fc-order relationship between (x,..., x, x') would 
satisfy 

r^(x... ,x,x') = r^iy, ...,y)= r'^ix.. . ,x). (49) 

Meanwhile, the facts of proximities in proximity networks 
follow order decreasing property px{x,x') < Px(x) and 
r\{x,x') = Px{x,x') — 2e,r^(x) = Px{x) — e from (fT9] l 
implies 

rj^{x,x') <r]^(x). (50) 

Combining (l50l l with the identity property inherited from 
high order networks [cf. Definition |5l r^(x,..., x, x') = 
rx{x,x'),r^{x, ...,x) = r]^{x) gives us 

r^(x,...,x,x') <r^(x,...,x), (51) 

which contradicts with (|49] | and shows that p must be injective. 
The rest of the proof follows. ■ ■ 

Proof of Theorem |4j The proof in Appendix |B] has demon¬ 
strated that d-p p is a pseudometric in the space mod =. 
To prove that dp^p is a metric in the same space we further 
demonstrate the missing part in the (iii) identity property in 
Definition |3] 

Proof of the second part of the identity property: We 

want to show that having dp^p{Px , Py) = 0 must imply 
that Px being isomorphic to Py. If dp^p^Px , Py) — 0, 
there exists a correspondence C such that ||r^Y(C')||p = 0. 
The property of p-norm implies that this correspondence C 
satisfies r\{xQ:k) = ry{yo-.k) for any 0 < k < K and any 
ixQ-,k, yo:k) G C. Define functions p : X ^ Y as in (|39) and 
Ip : Y ^ X as in (l47l i. the analysis in Appendix m Proof 
of Theorem |3] has demonstrated that p and ip are bijections 
and that X and Y have same cardinality. Pick the bijection p 
and it follows r^(xo:fe) = T'y{p(xQ:k)) for any 0 < k < K 
and xo:fc G X. This shows that P^ = Pp and completes the 
proof of the identity statement. ■ ■ 




15 


Appendix E 

Proofs in SECTioN rv-BI 

Proof of Proposition |4] : We first prove ( |2^ by considering 
proximity networks and Pp and their corresponding dual 
dissimilarity networks and . Let the correspondence 
C between X and Y be the minimizing correspondence in 
dp{PP,PP) [cf. Definition fUl so that we can write 

4(^#.^#) = r^x.Pv(c)- (52) 


C may not be the minimizing correspondence for the distance 
dp{DP, Dp) [cf. Definition [TT], but since it is a valid 
correspondence, it holds true that 

dUDP,DP)<T'h^^j,^{C). ( 53 ) 

From the definition of duality [cf. (i22l i1. we may write 


{XO:k,yO:k)^C 


(l-d^(a;o:/c))-(l-dy(j/ 0 :fc)) 


The ones in ( [54l i cancel out and therefore, 

r^x,Dv(^) = r^x.Pv(^)- 

Substituting (iSZt and (fSTt back to (i55T l implies 

dUP^,PP)>dp{Dp,DP). 


(54) 


(55) 


( 56 ) 


Let the correspondence C between X and Y be the minimiz¬ 
ing correspondence in dp{Dp, Dp). Then C is also a valid 
correspondence for the distance dP{Pp, Pp). By symmetry, 
we have 


dUDP,DP)>dpiPP,PP). (57) 

Combining ([56l l and (fSTl i yields the desired result in (i2Tt . 

Next we prove (i24l i by considering PP and PP and their 
corresponding duals D^ and Dp. Let the correspondence 
C between X and Y be the minimizing correspondence in 
dv.piPp^ Pp) [‘'f- Definition [121 so that we can write 

dvAPLPY) = \K.,pAC)\\^- (58) 

C may not be the minimizing correspondence for the distance 
dv,p{Dp, Dp) [cf. Definition [121, but again since it is a valid 
correspondence, we may write 

dvADx,DP)<\\rP^^P,^{C)\\^. (59) 

We have demonstrated in proving (i23t that for any integers 
0 < k < K, Tp^ jj^{C) = TA p^{C). In vector form, 
this is ~ Therefore, the property of 

p-norm implies that 

\\rP,^pAC)\l = \\P^x.PYic)\\,- ( 60 ) 

Substituting (i58l l and ([59] l back to (i60l l yields 

dvAPp,PP) > dv.piDp.DP). ( 61 ) 

Let the correspondence C between X and Y be the minimiz¬ 
ing correspondence in dxi.piDP, Dp ). Then C" is also a valid 
correspondence for d'Pp{PP, PP). By symmetry, we have 

dv.p{DP,DP) > dr,piPP,PP). (62) 

Combining ( IMT l and ( i62] i yields the desired result in (i24l i. ■ 
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